NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Inducing Social Optimality in Games via Adaptive Incentive Design

https://doi.org/10.1109/cdc51059.2022.9992685

Maheshwari, Chinmay; Kulkarni, Kshitij; Wu, Manxi; Sastry, S. Shankar (December 2022, 2022 IEEE 61st Conference on Decision and Control (CDC))

How can a social planner adaptively incentivize selfish agents who are learning in a strategic environment to induce a socially optimal outcome in the long run? We propose a two-timescale learning dynamics to answer this question in games. In our learning dynamics, players adopt a class of learning rules to update their strategies at a faster timescale, while a social planner updates the incentive mechanism at a slower timescale. In particular, the update of the incentive mechanism is based on each player’s externality, which is evaluated as the difference between the player’s marginal cost and the society’s marginal cost in each time step. We show that any fixed point of our learning dynamics corresponds to the optimal incentive mechanism such that the corresponding Nash equilibrium also achieves social optimality. We also provide sufficient conditions for the learning dynamics to converge to a fixed point so that the adaptive incentive mechanism eventually induces a socially optimal outcome. Finally, as an example, we demonstrate that the sufficient conditions for convergence are satisfied in Cournot competition with finite players.
more » « less
Full Text Available
Dynamic Tolling for Inducing Socially Optimal Traffic Loads

https://doi.org/10.23919/ACC53348.2022.9867193

Maheshwari, Chinmay; Kulkarni, Kshitij; Wu, Manxi; Sastry, S Shankar (June 2022, IEEE)

Full Text Available
DEC-LOS-RRT: Decentralized Path Planning for Multi-robot Systems with Line-of-sight Constrained Communication

https://doi.org/10.1109/CCTA48906.2021.9659247

Tuck, Victoria; Pant, Yash Vardhan; Seshia, Sanjit A.; Sastry, S. Shankar (August 2021, 2021 IEEE Conference on Control Technology and Applications (CCTA))

Decentralized planning for multi-agent systems, such as fleets of robots in a search-and-rescue operation, is often constrained by limitations on how agents can communicate with each other. One such limitation is the case when agents can communicate with each other only when they are in line-of-sight (LOS). Developing decentralized planning methods that guarantee safety is difficult in this case, as agents that are occluded from each other might not be able to communicate until it’s too late to avoid a safety violation. In this paper, we develop a decentralized planning method that explicitly avoids situations where lack of visibility of other agents would lead to an unsafe situation. Building on top of an existing Rapidly exploring Random Tree (RRT)-based approach, our method guarantees safety at each iteration. Simulation studies show the effectiveness of our method and compare the degradation in performance with respect to a clairvoyant decentralized planning algorithm where agents can communicate despite not being in LOS of each other.
more » « less
Full Text Available
Combining Model-Based Design and Model-Free Policy Optimization to Learn Safe, Stabilizing Controllers

https://doi.org/10.1016/j.ifacol.2021.08.468

Westenbroek, Tyler; Agrawal, Ayush; Castañeda, Fernando; Sastry, S Shankar; Sreenath, Koushil (January 2021, IFAC-PapersOnLine)

Full Text Available
Learning Min-norm Stabilizing Control Laws for Systems with Unknown Dynamics

https://doi.org/10.1109/CDC42340.2020.9304118

Westenbroek, Tyler; Castaneda, Fernando; Agrawal, Ayush; Sastry, S. Shankar; Sreenath, Koushil (December 2020, 2020 59th IEEE Conference on Decision and Control (CDC))
null (Ed.)
Full Text Available
On Gradient-Based Learning in Continuous Games

https://doi.org/10.1137/18M1231298

Mazumdar, Eric; Ratliff, Lillian J.; Sastry, S. Shankar (January 2020, SIAM Journal on Mathematics of Data Science)

Full Text Available
Policy-Gradient Algorithms Have No Guarantees of Convergence in Linear Quadratic Games

https://doi.org/10.5555/3398761.3398862

Mazumdar, Eric; Ratliff, Lillian J.; Jordan, Michael I.; Sastry, S. Shankar (January 2020, AAMAS Conference proceedings)
null (Ed.)
Full Text Available
Improving Input-Output Linearizing Controllers for Bipedal Robots via Reinforcement Learning

Castañeda, Fernando; Wulfman, Mathias; Agrawal, Ayush; Westenbroek, Tyler; Tomlin, Claire J.; Sastry, S. Shankar; Sreenath, Koushil (June 2020, Learning for Dynamics and Control (L4DC))

The main drawbacks of input-output linearizing controllers are the need for precise dynamics models and not being able to account for input constraints. Model uncertainty is common in almost every robotic application and input saturation is present in every real world system. In this paper, we address both challenges for the specific case of bipedal robot control by the use of reinforcement learning techniques. Taking the structure of a standard input-output linearizing controller, we use an additive learned term that compensates for model uncertainty. Moreover, by adding constraints to the learning problem we manage to boost the performance of the final controller when input limits are present. We demonstrate the effectiveness of the designed framework for different levels of uncertainty on the five-link planar walking robot RABBIT.
more » « less
Full Text Available
LESS is More: Rethinking Probabilistic Models of Human Behavior

https://doi.org/10.1145/3319502.3374811

Bobu, Andreea; Scobee, Dexter R.; Fisac, Jaime F.; Sastry, S. Shankar; Dragan, Anca D. (March 2020, International Conference on Human-Robot Interaction (HRI))

Full Text Available
Gradient-based inverse risk-sensitive reinforcement learning

https://doi.org/10.1109/CDC.2017.8264535

Mazumdar, Eric; Ratliff, Lillian J.; Fiez, Tanner; Sastry, S. Shankar (December 2017, Proceedings of the 56th IEEE Conference on Decision and Control)

Full Text Available

Search for: All records